The next generation of literature analysis: Integration of genomic analysis into text mining
نویسندگان
چکیده
Text-mining systems are indispensable tools to reduce the increasing flux of information in scientific literature to topics pertinent to a particular interest in focus. Most of the scientific literature is published as unstructured free text, complicating the development of data processing tools, which rely on structured information. To overcome the problems of free text analysis, structured, hand-curated information derived from literature is integrated in text-mining systems to improve precision and recall. In this paper several text-mining approaches are reviewed and the next step in development of text-mining systems, which is based on a concept of multiple lines of evidence, is described: results from literature analysis are combined with evidence from experiments and genome analysis to improve the accuracy of results and to generate additional knowledge beyond what is known solely from literature.
منابع مشابه
A review of text mining approaches and their function in discovering and extracting a topic
Background and aim: Four text mining methods are examined and focused on understanding and identifying their properties and limitations in subject discovery. Methodology: The study is an analytical review of the literature of text mining and topic modeling. Findings: LSA could be used to classify specific and unique topics in documents that address only a single topic. The other three text min...
متن کاملCrime pattern analysis through text mining
Today, new data and text mining technologies provide a next generation of tools for the analysis and visualization of both structured data and text. Such tools help increase the quality and productivity of the analysis and reduce the latency period between recording raw data and obtaining key knowledge necessary for making informed decisions.
متن کاملThe analysis of the relationship between Lorestan cave barbs (Garra typhlops and Garra lorestanensis) and Garra gymnothorax populations in Dez and Karkheh River drainages
The cave barb habitat is located in a Karst formation along the Sezar River. The springs on the walls of the Sezar River valley may provide a means for fish in surface waters to penetrate into the underground waters. These observations propose the probability for a migratory relationship between Garra gymnothorax in the Sezar River and the cave barbs (Garra typhlops and Garra lorestanensis). In...
متن کاملDesigning a System for Trend Analysis of Users in Website Surfing in Iran Using Data Mining and Text Mining Algorithms
Background and Aim: As of the entrance of web surfing to the lifestyle of a vast majority of people in the society and the need for a more accurate social and cultural policy making in the field, authors intended to analyze the behavior of the society users in viewing different websites so as to help politicians and practitioners. Methods: Design science research method is used in this research...
متن کاملI-37: Establishing High Resolution Genomic Profiles of Single Cells Using Microarray and Next-Generation Sequencing Technologies
The nature and pace of genome mutation is largely unknown. Standard methods to investigate DNA-mutation rely on arraying or sequencing DNA from a population of cells, hence the genetic composition of individual cells is lost and de novo mutation in cell(s) is concealed within the bulk signal. We developed methods based on (SNP-) arraying and next-generation sequencing of single-cell whole-genom...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Briefings in bioinformatics
دوره 6 3 شماره
صفحات -
تاریخ انتشار 2005